Dichroic /-sum rule and the orbital magnetization of crystals 
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We consider the magnetic circular dichroism spectrum of a crystal with broken time-reversal sym- 
metry in the electric-dipole approximation. Using the Kubo- Greenwood formula for the absorptive 
part of the antisymmetric optical conductivity, its frequency integral is recast as a ground-state 
property. We show that in insulators this quantity is proportional to the circulation of the occupied 
Wannier orbitals around their centers (more precisely, to the gauge-invariant part thereof). This 
differs from the net circulation, or ground state orbital magnetization, which has two additional 
contributions: (i) the remaining Wannier self-rotation, and (ii) the "itinerant" circulation arising 
from the center-of-mass motion of the Wannier orbitals, both on the surface and in the interior of 
the sample. Contributions (i) and (ii) are not separately meaningful, since their individual values 
depend on the particular choice of Wannier functions. Their sum is however gauge-invariant, and 
can be inferred from a combination of two experiments: a measurement of the magneto-optical 
spectrum over a sufficiently wide range to evaluate the sum rule, and a gyromagnetic determination 
of the total orbital magnetization. 

PACS numbers: 78.20.Ls, 75.10.Lp, 73.43.-f 
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I. INTRODUCTION 

Optical sum rules provide a link between excitation 
spectra and ground-state properties. The best-known 
example is the /-sum rule of atomic physics^ It relates 
the frequency-integrated absorption of linearly polarized 
light to the number of valence electrons. In this work 
we consider the analogous result for circularly polarized 
light. For non-magnetic systems the circular /-sum rule 
is simply the average of the /-sum rules for the two 
linearly-polarized components of the beam, again yield- 
ing the total number of electrons. If, however, the sys- 
tem is magnetized, cither spontaneously or by an applied 
field, this is no longer the case; there is a small correction 
that flips sign when either the magnetization of the sam- 
ple or the helicity of the incident light is reversed. We 
are interested in what information this correction to the 
circular /-sum rule provides about the magnetization. 

The differential absorption of right- and left-circularly- 
polarized light by magnetic materials is known as mag- 
netic circular dichroism (MCD). The object of interest 
in this work can thus be viewed as a "dichroic" /-sum 
rule for the integrated MCD spectrum. Such a sum rule 
was first derived by Hasegawa and Howard for the spe- 
cial case of a hydrogen atom in a magnetic field^ They 
showed that it is proportional to the quantum-mechanical 
expectation value of the orbital angular momentum op- 
erator, i.e., to the orbital moment. It has been assumed 
that this conclusion generalizes trivially to many-electron 
systems such as solids^^ This is not so^ as shown by Op- 
peneer, who obtained the correct sum rule for that case£ 
He observed that it yields a quantity that is subtly dif- 
ferent from the orbital magnetization M or b, and should 
instead be viewed as one of two terms adding up to M or b- 

In a separate development, a rigorous theory of orbital 
magnetization in crystals was recently formulatedi 7 -'^'^^ 



Interestingly, it also identifies two separate contributions 
to M or b- One key result of the present work is to recast 
the dichroic /-sum rule in the language of this modern 
theory, elucidating its physical content. Conversely, the 
sum rule solves an open problem in the theory of Refs. 7, 
[slJoiriol as raised explicitly in Ref. whether the two 
gauge-invariant contributions to M or b identified therein 
are separately measurable in principle. The present work 
answers this question in the affirmative. 

Although we will mostly focus on crystalline solids, we 
find it useful to start in Sec. |TT] by discussing the sum 
rule in the more general context of bounded samples un- 
der open boundary conditions. The detailed treatment of 
periodic crystals is deferred until Sec. IIIII In both cases, 
special emphasis will be placed on insulating systems, for 
which an intuitive real-space picture in terms of occupied 
Wannier orbitals can be given. We conclude in Sec. IIVI 
with a summary and outlook. In Appendices [Al [B] and 
Owe derive and elaborate on some results quoted in the 
main text. In particular, Appendix |A1 discusses the re- 
lation between the dichroic /-sum rule and three other 
known sum rules. 



II. BOUNDED SAMPLES 

A. Preliminaries 

In this work we are interested in systems displaying 
broken time-reversal symmetry in the spatial wavefunc- 
tions. A typical example would be a ferromagnet such 
as iron in which the exchange interaction breaks time- 
reversal symmetry in the spin channel and this symme- 
try breaking is then transmitted to the orbital degrees 
of freedom by the spin-orbit interaction. Other exam- 
ples include systems in applied magnetic fields, and also 
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certain spinlcss model Hamiltonians such as the Haldanc 
models 

We work in the independent-particle approximation. 
The interaction with light will be treated in the electric- 
dipole approximation, valid at not-too-high frequencies. 
This should be adequate provided that the sum rule sat- 
urates before higher-order contributions, such as electric 
quadrupole and magnetic dipole terms, become signifi- 
cant. The oscillator strength for the transition between 
one-electron states n and m is 
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This expression, valid for a general polarization e of light, 
can be derived in the same wav-i as the familiar oscillator 
strength formula for linear polarization e = x. For light 
propagating along z with circular polarization 
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( "+" corresponds to positive helicity, or left-circular po- 
larization), 



f(±) = L\f> + f> 1 =r f» 

Jn — >m 2 \_ J nm,xx 1 •lnm,yy] ' J nm,xyi 

where we have introduced the matrix 
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Here a, (3 label Cartesian directions, huj mn — £ m — £ n , 
and m e is the electron mass. Note that the matrix 
/ = /' + if" is Hermitian in the Cartesian indices. Thus 
its real and imaginary parts are symmetric and antisym- 
metric respectively. 

According to Eq. the sum of the oscillator 

strengths for the two circular polarizations e + and e 
equals the sum of the oscillator strengths for the two 
linear polarizations x and y, and is related to /'. The 
circular dichroism, i.e., the difference between the two 
circular oscillator strengths, is given by /": 
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Consider now a macroscopic system (e.g., a sample of 
volume V cut from a bulk crystal) and decompose its 
optical conductivity cr Q ^(o;) in three different ways: (i) 
real and imaginary parts, a' and a"; (ii) symmetric and 
antisymmetric parts, cts and a a', (hi) Hermitian and anti- 
Hcrmitian parts, cth and ctah- Then 



and 



oh 
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where the Cartesian indices and the frequency have been 
omitted. The properties of a can be summarized by not- 
ing that the Hermitian part is dissipative while the anti- 
Hermitian part is reactive, and the symmetric part is 



"ordinary" while the antisymmetric part is "dichroic." 
At T = the dissipative (or absorptive) part is 
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where — e is the electron charge. The analog of Eq. © 
in terms of conductivities is 



1 



(9) 



Thus the difference in absorption between light with neg- 
ative and positive helicity is given by twice the imaginary 
part of the antisymmetric optical conductivity^ 



A,xy 



abs 
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(10) 



Like other magneto-optical effects, MCD vanishes for 
time-reversal-invariant systems. This can be seen from 
the Onsagcr relation a a p(H, M) = <rp a (— H, — M), which 
implies cr A ,a/3(H = M = 0) = 0. 



B. Dichroic /-sum rule 



With the notation 



(/) EE / f(t0)dw, 
JO 



(11) 



the dichroic /-sum rule relates the integrated MCD spec- 
trum ((7^ a g) to a certain ground-state property of the 
system. To see how, we begin by expressing a A aj3 (uj) as 
the imaginary part of the Kubo-Grecnwood formula ((8]). 
Combining with Eq. (j4|) and taking the integral, 



7T(:' 
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Using the identity 
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(12) 



(13) 



and defining the projector onto the empty states Q = 

EZ pty H(m\, 



7re 
2W 



Y^(n\r a Qvp\n)-{a^p). (14) 



Introducing the pseudo-vector ct a = (l/2)e a( g 7 cr A Q/3 

and P = J2m C l n )( n l' this can be written more concisely 
as 



ne 
'2W 



Tr[Pf x Qv] 



(15) 
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Eq. (|T5|) is the dichroic /-sum rule, also obtained in 
Ref. @. Using the closure relation Q = 1 — P, it be- 
comes apparent that the right-hand-side depends exclu- 
sively on the occupied states, and is closely related to 
the total (macroscopic) ground-state orbital magnetiza- 
tion M or b = 7Tr[Pr x v], where 7 = — (e/2cV) in elec- 
trostatic units (esu). Writing 

M orb = M| + AM, (16) 

with 

M« = 7 Tr[Pr x Qv] (17) 

and 

AM = 7 Tr[Pr x Pv] (18) 
(the notation will be explained shortly), Eq. (|15[) becomes 

(*>{) = ™Mg. (19) 

Hence the sum rule yields an orbital quantity Mg 1 ^ with 
units of magnetization, but differing from the actual or- 
bital magnetization by the remainder AM. 

Two of the three quantities in Eq. (|16p are indepen- 
dently measurable. The left-hand side can be determined 

from gyromagnetic experiments , 13 ' 14 ! 15 while on the 
right-hand side is obtainable from magneto-optical ex- 
periments via the sum rule. Thus, their difference AM 
can also be determined in principle. However, measuring 
Mgg and AM independently will be of only limited in- 
terest unless some physical meaning can be attached to 
each of them separately. With this goal in mind we shall 
now make contact with the recent theory of macroscopic 
orbital magnetization. 

C. Relation to the orbital magnetization 

The results obtained so far are fairly general. To pro- 
ceed further we specialize to insulating samples. For 
the present purposes "insulating" means that the ground 
state wavcfunction can be written as a Slater deter- 
minant of well-localized orthonormal molecular orbitals 
\wi), which we will gcnerically refer to as Wannier func- 
tions (WFs) even when the sample does not have a crys- 
talline interior. 1 ^ This definition encompasses a broad 
range of systems, both macroscopic and microscopic, but 
it excludes metals and Chern insulators^ which are not 
Wannier-representable in the above sense. 

By invariance of the trace, the orbital magnetization 
can be expressed in the Wannier representation as 

occ 

M orb = 7 ^(u; 4 |f x v\ Wi ). (20) 

i 

In Ref. @ this was decomposed asi& 

M orb = M SR + Mic (21) 



where 

occ 

m sr = 1 j2 ( w ^-^) ( 22 ) 

i 

arises from the circulation of the occupied WFs around 
their centers — (wi\r\wi) = r& ("self-rotation"), while 

occ 

Mic = X] *i x ( w i\v\wi) (23) 

i 

is the circulation arising from the motion of the centers 
of mass of the WFs. 

It is well known that the WFs of a given system are 
not uniquely defined; unitary mixing among the WFs 
is allowed, giving rise to a "gauge freedom" (not to be 
confused with the freedom to choose the electromag- 
netic gauge). In practice one deals with this issue by 
choosing, among the infinitely many possible gauges, a 
particular one that has certain desirable properties. A 
common strategy is to work in the gauge that mini- 
mizes the quadratic spread of the WFs, producing so- 
called maximally-localized WFs.— Naturally, any physi- 
cal observable (e.g., M or b) is necessarily invariant under 
a change of gauge. This is unfortunately not the case 
for the individual terms Msr and Mic in Eqs. (|2"2"|) - 
(|2"3"|) . which turn out to be gauge-dependent. This is to 
be expected since these quantities do not take the form 
of traces, unlike those in the decomposition introduced 
earlier via Eqs. (fTo) - (fTgl) . 

The two decompositions (JT6l) — (JTHI) and (j2T])-([23]) arc 
not unrelated, however. To see this, we insert the identity 
1 = Q+P at the location of the cross product in Eq. ([22]) 
to obtain 

Msr = M«+M^ (24) 

where Mg 1 ^ is the quantity defined in Eq. (TIT)) (since 
(wi\Q = 0), and 

occ 

M sr = 7(Tr[Pf x Pv] - ]T r, x v.) 

i 

occ 

In this way we have segregated the gauge-dependence of 
Msr to the term Mg 1 ^, isolating a gauge-invariant part 

Mg 1 ^ which turns out to be precisely the quantity defined 
in Eq. (|17|) and appearing in the sum rule (fl9|) . When the 

gauge-dependent self-rotation Mg^' is combined with the 
gauge-dependent itinerant circulation Mic, h forms the 
gauge- invariant quantity AM of Eq. (JT8J) . The relation 
between the decompositions (|16|) - (TT8"|) and (|2"Tj) - (|2"3"|) can 
be summarized by writing 

M orb = M« + Mg£ + Mic • (26) 

AM 
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There is a remarkable parallelism between the decom- 
position of the Wannier self-rotation (f2"2"| and the 
decomposition^ 

n = fli + h (27) 

of the Wannier spread 

occ 

tt = J2 <^l(?-r 4 ) 2 K) (28) 

i 

into a gauge-invariant part 

n 1 = J2 Tr[Pf Q gf Q ] (29) 

a 

and a gauge-dependent part 

occ 

n=^T\r ij f. (30) 

The similarities between Eqs. (JTTJ) and (f2"9")h and be- 
tween Eqs. ([25]) and (|30|) . are striking. Interestingly, the 
gauge-invariant spread Qi is related to the "ordinary" ab- 
sorption spectrum by a second sum rule, as discussed in 
Ref. [l^ and Appendix [AJ In addition, the interpretation 
of f^i as a measure of the quadratic quantum fluctua- 
tions, or "quantum spread," of the many-electron center 
of massif is mirrored by Mgp having the meaning of a 
center-of-mass circulation, as discussed in Appendix [Bj 

First-principles calculations show that for maximally- 
localized WFs, 51 is typically much smaller than f2iJ£ 
Indeed, the minimization of the spread acts precisely to 
reduce £7 as much as possible. In general £1 cannot be 
made to vanish exactly in two or higher dimensions, since 
the non-commutativity of PxP, PyP, and PzP implies 
that the off-diagonal ry cannot all be zero. In prac- 
tice, however, they can become quite small. According 

to Eq. P5|) , Mgg would also vanish if all off-diagonal 
were precisely zero. Hence we expect the self-rotation of 
maximally-localized WFs to be dominated by the gauge- 
invariant part as well^ 

The fact that AM is composed of self-rotation 
and itinerant-circulation parts which arc not separately 
gauge-invariant means that angular momentum can be 
converted back and forth between Msr and Mic via 
gauge transformations. This will be discussed in more 
detail in Sec. IIIIDI here we simply note that the two 
parts are similar in that both originate from the spatial 
overlap between neighboring WFs. This is evident from 
the definition of Mgg , and for Mic it follows from writ- 
ing v; in terms of the "current donated from one Wannier 
orbital to its neighbors" as in Ref. 0- AM can therefore 
be interpreted as an interorbital contribution to M or b, 
even though it includes part of the self-rotation, while 
Mgp is the purely intraorbital portion. (Similarly, fli 
and Q are the intraorbital and interorbital parts of the 
Wannier spread, respectively.) 



III. CRYSTALLINE SOLIDS 



In this Section we apply the general formalism of SecUTl 
to crystalline solids, recasting the relevant quantities in 
the form of Brillouin zone integrals. We start in Sec. lIII Al 
by rederiving the dichroic /-sum rule for Bloch electrons. 
In the remaining subsections we explore the connections 
between this bulk reformulation and the theory of orbital 
magnetization in crystals 

A somewhat unsatisfying aspect of that theory as de- 
veloped in Ref. [H is the lack of consistency in the way 
the orbital magnetization was decomposed, in the follow- 
ing sense. One partition (M or b = Mlc + Mic m their 
notation^) was made for bounded samples, after which 
the thermodynamic limit was taken for each term sepa- 
rately. The resulting fc-space expressions were then com- 
bined to form the total M orD . Finally, working in fc-space, 
a different partition (M or b = Mlc + Mic) was identi- 
fied whose individual terms were gauge- invariant, unlike 
those of the original decomposition. In the process, how- 
ever, the intuitive real-space interpretation of the original 
decomposition was lost, and the separate meanings of the 
two terms in the gauge-invariant decomposition was left 
unclear. 

Here, instead, we shall work from the very beginning 
with the two gauge-invariant terms Mg 1 ^ and AM, which 
afford a simple real-space interpretation in terms of WFs. 
They are first identified for fragments with a crystalline 
interior (crystallites) in Sec. IIII Bl The thermodynamic 
limit of each term is then taken, producing the reciprocal- 
space expressions of Eqs. (|4T)]) - (|4"T)) (the details of the 
derivation can be found in Appendix [Cjl . Interestingly, 

we find that our gauge-invariant terms M s ^ and AM 
differ from - but are simply related to - those of the 
gauge-invariant decomposition of Ref. 0. In the partic- 
ular case of an insulator with a single valence band, on 
the other hand, they reduce exactly to the terms identi- 
fied in Ref. d, as will be discussed in Sec. IIII CI Because 
the work of Ref. d is based on a semiclassical picture of 
wavepacket dynamics, however, it is not easily general- 
ized to a multiband gauge-invariant framework as is done 
here. 

In Eq. (USD of Scc.|TT]thc decomposition AM = M^ + 
Mic for insulating systems was obtained by working in 
the Wannier representation. For insulating crystallites 
Mic can be divided further into a "surface" part M| c ulf - ) 
and an "interior" part Mj™ . The interplay between the 
resulting three contributions to AM will be the focus 
of the final two subsections. Single-band insulators are 
discussed in Sec. IIII CI The general case of multiband 
insulators is considered in Sec. IIII Dt where the gauge- 
transformation properties of those terms is analyzed. 
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A. Dichroic /-sum rule 

The first step is to rewrite the Kubo-Greenwood for- 
mula jS} in a form appropriate for periodic crystals, 
where dipole transitions connect valence and conduction 
Bloch states with the same crystal momentum k. Eq. (|4|) 
becomes, dropping the index k for conciseness, 

f nm ,a0 = - (2m e uj mn /h) (u n \d a u m ) (u m \dpu n ) , (31) 

where d a = d/dk a and we have used the relational 
v nm ,a = u mn (u n \d a u m ) for m / ti, with \u n ) a cell- 
periodic Bloch state. Eq. |(5J) now reads 



<7hM = 



occ empty 

E E fnmSfa -W mn ). (32) 



2m e J (2tt) 3 
Consider the frequency integral of <7u(to 



dk 



2m e J (2tt) 3 



occ empty 

EE/* 



(33) 



The dichroic /-sum rule will be obtained from the imag- 
inary part of this complex quantity, while the real part 
yields the ordinary /-sum rule (see Appendix [SJ . 
Using Eq. (|3 1 [) to expand the summation, 



occ empty 

^ ] ^ ] fnm,af3 



2m e 

IF 
IF 



occ empty 



^ ^ (u n \d a u 7n )(E rn - E n )(u m \dpu n ) 

n m 
occ empty 

E E (daUn\u m )(E n - E m )(u m \df3U n ) 



2m e 

—3- (,3k, a/3 — «k,a/3j 



(34) 



where we have introduced a set of notations as follows: 

occ 

bh.,gp = ^ {d a U n \dpUn), (35) 



and 



3k,a/3 = (aaUnl-ffl^Un), 



= -En {d a U n \df}U n ) 



(36) 



(37) 



The symbol 9 denotes the covariant derivative 
defined as \d a u n w) = Qk|<9 Q u„k), where Qk = 
S^" Pty |wmk)<Wmk|- The imaginary part of 6 k ,a/3 is es- 
sentially the Berry curvature while its real par t is related 
to the quantum metric (Appendix C of Ref.llq; we discuss 



the physical content of 6k,a,3 in Appendix lA]) . Quanti- 
ties gk, a/3 and fok,a/3 are similar to &k,a/3 except that they 
carry an extra factor of Hamiltonian or energy. Note that 
&k,a/3 corresponds to the quantity fw,a/3 in Ref. 0, while 
5k, a/3 and /ik,a/3 are the same as in that work. 
With these definitions Eq. |33|) becomes 

. . ire 2 f dk . 

(CH,a/3) = -^2" J J^Kf (3k,a/3 ~ /lk,a/3j- (38) 



The imaginary part reads, in vector form, 



dk 

h 2 J (2^ 



(39) 



This is the dichroic /-sum rule in the Bloch representa- 
tion. 

We can now compare this result with the decomposi- 
tion obtained in Ref. HI, where the ground-state orbital 
magnetization was partitioned into two gauge-invariant 
terms as 



where 



M orb = M LC + Mic 



~ e f dk .. 

Mlc = — / — gk, 



he J (2tt) s 



Mic = 



dk 



he J (2?r) 



3 K- 



We thus arrive at our main result 

«) = ^(M LC -M IC ) 



(40) 



(41) 



(42) 



(43) 



relating the integrated MCD spectrum to the compo- 
nents of the orbital magnetization. Note that the sum 
rule is proportional to the difference between the gauge- 
invariant contributions of Ref. U By independently 
measuring the sum of Mlc and Mic via gyromagnetic 
experiments^ and the difference via the magneto-optical 
sum rule, the value of each individual term can indeed be 
measured in principle, resolving an open problem posed 
in Ref. ©. 

Strictly speaking, Eqs. (|4"0"|) - (|4"2")) as written are valid 
for conventional insulators only. The generalization to 
metals and Chern insulators is subtle, but the under- 
standing emerging from Refs. iiE3 is that the appro- 
priate generalization is obtained by making the replace- 
ments H — > H — \i and E n — > E n — fi in Eqs. (|36[) and 
(f3"T)) . where [i is the electron chemical potential. Clearly 
5k — and with it the sum rule (|39p . are insensitive to 
these substitutions^ 

Comparing Eqs. (|4"0"|) and (|4"3"| for extended crystals 
with Eqs. (fTo| and (fH)j) for bounded samples, it appears 
plausible that the two partitions (fTB|) and (|4"0|) of M or b 
ought to be related by 



M 



(i) 

SR 



M 



LC 



M 



ICj 



(44) 
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AM = 2Mic, (45) 

or explicitly, 




The correctness of these identities is demonstrated in Ap- 
pendix [C] by taking the thermodynamic limit of results 
derived in the next subsection. 



B. The magnetization of an insulating crystallite 

To gain a better understanding of the bulk expressions 
derived in the previous section, we now specialize the re- 
sults obtained for bounded samples in Sec. Ill CI to the 
case that the sample has a crystalline interior. Work- 
ing in the Wannicr representation, we are then able to 
establish connections between the fc-space and Wannicr 
viewpoints and associate a local physical picture with 
the various terms appearing in the bulk orbital magneti- 
zation. 

Following Rcfs. we divide our crystallite into 

"surface" and "interior" regions. This division is largely 
arbitrary, and it only needs to satisfy two requirements: 
(i) the border between the two regions should be placed 
sufficiently deep inside the sample where the local envi- 
ronment is already crystalline, and (ii) the surface region 
should occupy a non-extensive volume in the thermody- 
namic limit. The Wannicr orbitals spanning the ground 
state are assigned to each region. Those in the interior 
converge exponentially to the bulk WFs |Rn) (R is a lat- 
tice vector), and those on the surface will be denoted by 

We first divide the orbital magnetization into self- 
rotation (SR) and itinerant-circulation (IC) contribu- 
tions according to Eqs. (|2Tj) - (f23| . In the thermodynamic 
limit the SR part, which only involves the relative coordi- 
nate f — fj, is dominated by the interior region. Invoking 
translational invariancc, 

M SR = 7c [( 0n l f x v|On) - r n x v„] , (48) 

n 

where j c = —e/(2cV c ), with V c the cell volume, r„ = 
(On|f|On), and v„ = (On|v|On). Henceforth summations 
over band-like indices span the valence-band states. 

Next we break down the self- rotation as in Eq. ([2~Xf . 
setting P = ErE„ |Rn)(Rn|: 

M sr = 7cRctr c [Pf x Qv] (49) 



M sr = 7c(Rctr c [Pf x Pv] - r n x v„) 

n 

= 7 C ^ V Re{(On|r|Rm) x (Rm|v|On)}. 

n Rm/On 

(50) 

The symbol tr c denotes the trace per unit cell. Note 
that we have taken the real part of the traces explicitly; 
this was not needed in Eqs. (TIT]) and (|25|) for bounded 
samples, where the traces were automatically real. 

Now we turn to the IC term (J23J) in Eq. |[2"T]). Un- 
like Msr, in the thermodynamic limit it generally has 
contributions from both interior and surface regions)^ 

M IC = M I ^ nt) +Mfe urf) . (51) 

The interior part becomes 

M lc t) =7c^rnxv„ (52) 

n 

where it was necessary to use 

J2 v » = ( 53 ) 

n 

when exploiting the translational invariancc. Eq. (|53|) 
expresses the fact that no macroscopic current, or dy- 
namic polarization,— flows through the bulk in a sta- 
tionary state. Because of this constraint, the quantity 
([52")) necessarily vanishes for insulators with a single va- 
lence band. In multiband insulators it takes the form 
of an intracell itinerant circulation: the WF centers in 
each cell can have a net circulation while their collective 
center-of-mass remains at rest. 

Finally, the surface contribution is 

N s 

Mfe urf) =7£f s xv s . (54) 

s=l 

It was shown in Rcfs. 0and| that in the thermodynamic 
limit this can be recast as 

M (surf) = _ry c j m ^ Rx(mO|f|nR)(nR|iT|mO), (55) 

mtiR 

where j c = j c /h. This result is remarkable in that it 
expresses a circulation in the surface region solely in 
terms of matrix elements between the interior WFs, in 
a way that does not depend on the precise location of 
the boundary between the two regions (provided that 
the boundary satisfies the two criteria mentioned earlier) . 
We emphasize that it holds for crystalline insulators only. 
Whereas M^f } is an intracell-like term, in the bulk 

form (53 M{ s c urf) is seen to have an intercell character, 
vanishing in the "Clausius-Mossotti" limit of zero over- 
lap between WFs belonging to different cells. The as- 
signment of the bulk WFs to specific cells is however 
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not unique, and by making a different choice it is possi- 
ble to convert between "intracell" Mj™ and "intcrccll" 
■jyj-(suif)^ ^ s other reasons to be detailed in 

Sec. IIII D[ the interior and surface parts of M or b are in 
general not physically well-defined, even in crystalline in- 
sulators. Collecting terms, the full orbital magnetization 
reads 

M orb = M« + M^+Mj^+M^ , (56) 

AM 

which is similar to Eq. (|2"6"|) except that the IC term has 
been separated into interior and surface parts. 

This Wannier-based decomposition of the magnetiza- 
tion of a crystallite follows closely that of Rcf. [{j. Two 
differences are worth noting. First, we have emphasized 
the distinction between Wannier self-rotation and itiner- 
ant circulation. In Ref. the emphasis was more on the 
separation between the surface contribution M|™ rf ' ) (de- 
noted by Mic in that work) and the interior contribution 
Mlc = 7ctr c [Pf x v] containing the net magnetic dipole 
density of the WFs in a crystalline cell. This "local cir- 
culation" includes all of the self-rotation as well as the 
intracell part of the itinerant circulation. In the present 
notation the decomposition of Ref. reads 

M orb = Mgp + + M^ nt) +Mj™ rf) . (57) 



Note that for one-band insulators Mj c = 0, in 
which case the interior contribution coincides with the 
self-rotation, and the surface part with the itinerant 
circulation.— Second, by identifying a gauge-invariant 
part of the self-rotation, we have been able to orga- 
nize the four resulting terms into the two gauge-invariant 
groups indicated in Eq. (|56|) . 

The present viewpoint appears to be more useful for 
arriving at a simple physical picture for the sum rule. It 
has the additional advantage of being applicable to disor- 
dered and microscopic systems, for which the distinction 
between interior and surface contributions loses meaning. 



C. One-band insulators 

We begin our discussion of M or b in insulators with a 
single valence band by considering the remainder AM. 
We saw in Sec. IIII Bl that. of the three terms into which it 
is naturally decomposed in the Wannier representation, 
one of them vanishes if there is only one WF per cell, 

M$f 5 = 0. (58) 
Remarkably, the two surviving terms become identical, 

Mg = M« = ^ (59) 



and thus individually gauge- invariant. This follows from 
Eqs. (|44|) - ((45|) in the one-band limit. Indeed, the quan- 
tities Mlc and Mic therein were defined in Ref. [pj in 
such a way that for one-band insulators they reduce to 
the quantities M LC = M S r and M|™ rf) in Eq. ([ST|) . It is 
then seen that Eqs. (|44p and (|45[) correspond to the first 
and second equalities in Eq. (f59|) respectively. We em- 
phasize that Eqs. ([5"8 ]) -(f5"9" l) only hold for crystalline WFs 
which respect the full translational symmetry of the crys- 
tal. If, for instance, a larger unit cell is used (effectively 
folding the Brillouin zone and turning the system into a 
multiband insulator), the additional gauge freedom can 
be used to construct WFs for which Eqs. ([55|) - ([55]) no 
longer hold. 

Consider now the full orbital magnetization. For one- 
band insulators the reciprocal-space expressions (146[) 
(l4"Tf reduce to 

M s« = 4r c lm Jwf a " k| X ^-E^u*) (60) 
and 

p f dk 

AM = — J — £ k Im(d kUk | x lft.Uk). (61) 

Their sum M or b is given by the right-hand-side of 
Eq. (f60|) with — i?k replaced by +i?k, which is the ex- 
pression originally obtained in Rcfs. and S More- 
over, the individual contributions M g p and AM coincide 
with those identified in Ref. |S|. Instead, the derivation of 
Ref. leads to the alternative - but, for one-band insula- 
tors, also gauge- invariant - partition into the "interior" 

and "surface" parts M LC = M S r = M^ + AM/2 and 
Mj™ rf) = AM/2. 

While the individual terms M g ^ and AM agree, for 
single-band insulators, with those of Ref. H, we interpret 
them somewhat differently here. Eq. ((60)) of Ref. H had 
the meaning of an intrinsic magnetic moment associated 
with the self-rotation of the carrier wavepackets. Accord- 
ing to the present derivation, that term is only part of 
the Wannier self- rotation. As for Eq. (fBTj) , in the deriva- 
tion of Ref. [8[ it was seen to arise from a Berry-phase 
correction to the electronic density of states, and was 
subsequently claimed to be associated with a boundary 
current circulation.— Instead, according to the present 
viewpoint only half of it originates in the itinerant circu- 
lation M|Q Urf ' of the surface WFs, while the other half is 

ascribed to the remaining self-rotation M^ of the WFs 
in the bulk. 



D. Gauge transformations for multiband insulators 

In multiband insulators all three terms Mg^', Mj|? , 

and M{™ rf) can be nonzero. However, their individual 
values arc not physically meaningful, since a gauge trans- 
formation can redistribute the total AM among them. 



In particular, it is interesting to consider gauge trans- 
formations that shift the location of a WF by a lattice 
vector. 

A general gauge transformation takes the forrn^ 



Ktnk) 



E 



U m k}U„ 



(62) 



where Uk is an Nb x Nb unitary matrix in the band in- 
dices. We assume that a transformation of this kind has 
already been applied to transform from the Hamiltonian 
cigcnstatcs at each k to a set of states that are smooth 
in k from which the WFs are to be constructed. We can 
then interpose an additional diagonal gauge transforma- 
tion 



link 



-tkR„ 



\Unk) 



(63) 



where R„ is a real-space lattice vector; this has the effect 
of shifting the location of WF n by R„. For a one-band 
insulator, or if R„ is the same for all bands, this amounts 
to shifting the choice of the "home" unit cell. However, 
in the multiband case different WFs can be shifted dif- 
ferently, corresponding to the freedom in choosing which 
WFs "belong" to the home unit cell. 

For example, Fig. Q] shows four cells of a model two- 
dimensional crystal consisting of "molecular magnets" 
disposed on a square lattice with lattice constant a. Be- 
fore the transformation (|63|). the home unit cell contains 
the four WFs shown in bold in Panel (a). Applying the 
transformation with R2 = ax and R m = for all other 
WFs changes the selection of the "basis" of WFs belong- 
ing to the home cell to be that shown in Panel (b). 

How does this affect the individual terms composing 
AM? Clearly the self-rotation £48]) is not affected. Ac- 
cording to Eq. (|52|) . Mj™^ changes by 7 C R2 x?2. To pre- 
serve the overall invariancc of AM the remaining term 
Mj™ lf ^ must change by an equal and opposite amount. 
Let us see in more detail how this comes about. 

We begin with a formal derivation. The fc-space ex- 
pression for M|™ rf ^ is given by-2 



M 



(surf) 
IC 



Im 

2hc ^ 

mil 



dk 



-Emnk(<9kUnk| X |<9kU mk ) 



(64) 

A few steps of algebra show that under the transforma- 
f ^ changes by 

e x - ; dk 

Yc n X ^./ (2^ 



tion ([Ml) M IC 



■Re{(u n \dkU m }(um\Hk\u n }}- (65) 



Replacing {u n \c\u m ) by — (dkU„ \u m ) allows to identify a 
term Pk-ffk = in the above expression, which becomes 

dk 



he 



(2tt) 3 

Comparing with the Wannier velocity^ 



Re{(un\H k \dku n )}. (66) 
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FIG. 1: Schematic model of a molecular crystal with one 
molecule per cell and four WFs per molecule. The range of 
the orbitals is indicated by the overlapping circles, and their 
center-of-mass velocities v„ are denoted by arrows. The two 
panels show in bold two possible choices of "Wannier basis." 



and setting n = 2 then produces the desired result 
— 7 C R2 x V2 for the change in Mj^ 1 "^. 

Coming back to the example in Fig. [TJ the intramolec- 
ular orbital overlap gives rise to the nonzero velocities 
v n indicated by the arrows. With the choice of Wannier 
basis of Panel (a) , the collective circulation of the Wan- 



nier centers in each cell results in a finite M 

r (surf) 



(int) 
IC 



while 

from Eq. (|55|) Mj^ urt ' vanishes, since there is negligible 



l ic 

intercell overlap. When the configuration of Panel (b) is 

chosen, M|^ urf ^ becomes — 7 C R2 X v 2 . From the present 
viewpoint this nonzero value is made possible by the in- 
tramolecular (but now intercell) overlap between the sec- 
ond WF of each cell with WFs one, three, and four from 
the cell shifted by R2. 



To view M 



(surf) 
IC 



as a surface contribution rather than 



a bulk intercell term, we consider now a finite sample of 
the same crystal (Fig. [2]), which has been divided into 
surface and interior regions. In deciding which WFs are 
"interior-like" and which are "surface-like" we shall re- 
quire that all WFs assigned to the same cell must belong 
to the same region. If the Wannier basis of Fig. [lja) is 
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FIG. 2: A finite sample cut from the bulk crystal of Fig. \T\ 
With the choice of Wannier basis of Fig. [T{a), "interior" and 
"surface" WFs lie inside and outside the dashed line respec- 
tively; with that of Fig. QJb), they are denoted by solid and 
open circles respectively. Right panel: open and solid arrows 



show the extra "itinerant currents" (MjJ, urf ^ 
spectively) associated with the latter choice. 



and MJc* 5 re 



used, the surface region can be chosen to comprise the 
outermost layer of molecules, so that the border between 
the two regions is given by the dashed line. The four WFs 
on each molecule form a unit with some internal IC circu- 
lation but zero center-of-mass velocity. The total sample 
magnetization is the sum of all such internal circulations, 
which in the large-sample limit is interior-dominated, so 
that M| s c urf) -> 0. 

If the Wannier basis of Fig. |ljb) is chosen instead, the 
upper and lower surface regions are still composed of the 
outermost layer of molecules. However, the left surface 
now contains, in addition, one WF from each molecule in 
the second layer. Those lone surface WFs carry a down- 
ward particle "IC current" which extends along the left 
surface and is indicated by an open arrow on the right 
panel. A corresponding IC current appears on the right 
surface, and together they yield M|™ lf ^ = — (e/2ca)|v2|z, 

which agrees with the result M[^ urf ' = — 7 c R-2 x V2 found 
earlier using a purely bulk argument [in this example 
7 C = — e/(2ca 2 )]. A change of gauge should not change 
any physical quantity, such as the actual current flow- 
ing on the left surface. Since it appears to change by 
adding the open arrow, there must be another equal and 
opposite contribution (the adjacent solid arrow). This 
contribution is the "interior" IC current carried by the 
remaining three WFs (filled circles) on the molecules of 
the second layer. 

The situation just described is reminiscent of the 
"quantum of polarization" in the theory of dielectric 
polarization^ where a change of Wannier basis like that 
leading from Fig. QJa) to Fig. QJb) shifts the polarization 
by a quantum and also changes the surface charge by one 
electron per surface cell area. This might suggest that the 
full gauge invariance of the interior and surface parts of 
M or b discussed in Sec. IIII Cl for single-band insulators be- 
comes, in multiband insulators, a gauge-invariance mod- 
ulo 7 C R„ x v„. While true for this particular example, 
this is generally not so3i Even for this model it will cease 



to be true as soon as the molecules start overlapping sig- 
nificantly. When this happens, the value of Mj™ 1 ^ can 
be tuned continuously using other types of gauge trans- 
formations, e.g., the continuous diagonal transformation 



"nk) 



\Unk) 



(68) 



with 6*„.k+G = #nk- This produces a change in M^" 1 ^ 
given by Eq. (f6"o) with R„ therein replaced by a factor of 
— (?k#nk m the integrand. Since both r„ and v ra remain 
invariant (the former was shown in Ref. |2l| and the latter 
follows from Eq. (|53p together with the fact that all other 

v m arc unaffected), so does Mj^. The change in Mj^ 

must therefore be absorbed by M^'. 

To summarize, the transformation (|63j) transfers dis- 
crete amounts of itinerant circulation between the inte- 
rior and surface regions, while the transformation (|68[) 
converts continuously between interior self-rotation and 
surface itinerant circulation. Finally, under the most gen- 
eral transformation (|62p all three gauge-dependent terms 
in Eq. (|56|) can be affected simultaneously, so that only 
their sum AM is unique and physically meaningful. 



IV. SUMMARY AND OUTLOOK 

We have presented an exact sum rule for the MCD 
spectrum, elucidated its physical interpretation, and re- 
lated it to the recent rigorous formulation of orbital mag- 
netization in crystals. In insulating systems the sum 
rule probes the gauge-invariant part of the self- 

rotation of the occupied Wannier orbitals. The total or- 
bital magnetization has a second, less obvious contribu- 
tion AM, arising from the overlap between neighboring 
WFs. It comprises both self-rotation (SR) and itinerant- 
circulation (IC) parts in proportions which depend on 
the precise choice of WFs, while AM itself has a unique 
value. Although the intuitive interpretation in terms of 
the occupied WFs is restricted to Wannier-representablc 
systems such as conventional insulators, the terms 
and AM are in fact well-defined for all electron systems, 
including metals and Chern insulators. 

The practical importance of the sum rule is that it 
allows to break down M or b into physically meaningful 
parts, using a combination of gyromagnetic and magneto- 
optical measurements. This should provide valuable in- 
formation on the intraorbital (or localized) versus interor- 
bital (or itinerant) character of orbital magnetism. For 
example, it has been suggested (Ref. H?! Appendix B) 
that the anomalously large g-factors of Bi might be 
caused by itinerant circulations very much like the ones 
discussed here. On the basis of the present work it should 
now be possible to test this conjecture. 

In the last decade and a half a sum rule for the X-ray 
MCD spectrum^ has been used extensively to obtain site- 
specific information about orbital magnetism in solids. 
The resulting orbital moments have been compared with 
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gyromagnetic measurements of M or b^ If a significant 
itinerant contribution AM is present, one may expect 
a discrepancy between the XMCD orbital moments and 
the total M or b inferred from gyromagnetics. It would 
therefore be of great interest to find such systems defying 
the conventional wisdom about the connection between 
the MCD spectrum and orbital magnetization. 

The ideas discussed in this work should be most rel- 
evant for materials displaying appreciable orbital mag- 
netism and, in particular, appreciable interorbital effects 
which might enhance the ratio |AM/MgpJ. These cri- 
teria do not favor band ferromagnets. First, their or- 
bital magnetization tends to be relatively small. In the 
transition metal ferromagnets Fe, Co, and Ni, for ex- 
ample, it accounts for less than 10% of the spontaneous 
magnetization^ (For comparison, the field-induced or- 
bital magnetization of the d paramagnetic metals can be 
as large as the induced spin magnetization. This has 
been established both from gyromagnetic experimental 
and from first-principles calculations^) Secondly, ferro- 
magnetism is favored by narrow bands and localized or- 
bitals, for which interorbital effects are expected to be 
relatively minor. Finally, the spin-orbit-induccd M or b of 
ferromagnets is believed to be an essentially atomic phe- 
nomenon! largely confined to a small core region close 
to the nucleus ; 30 ' 31 and one might therefore expect AM 
to be small. Paramagnets and diamagnets, with rela- 
tively wide bands (e.g., the s-p metals and semiconduc- 
tors) and additional contributions to M or b unrelated to 
spin-orbit, therefore appear to be more promising can- 
didates. Among ferromagnets, the "zero magnetization 
ferromagnets,"— whose orbital magnetization is so large 
as to cancel the spin magnetization, might be particularly 
interesting. 

An important direction for future work is to carry out 
first-principles calculations of and AM for real ma- 
terials. Such calculations would test the validity of the 
assumption that orbital magnetism in solids is atomic- 
like in nature. 30 ' 31 While plausible, that assumption was 
made in the past partly out of practical necessity, since 
a rigorous bulk definition of M or b in terms of the ex- 
tended Bloch states was not available. Confronting ex- 
periment with a full calculation of M or b within spin- 
density functional theory (SDFT), including the itiner- 
ant terms, would clarify whether SDFT can adequately 
describe orbital magnetism in solids, or whether an ex- 
tended framework (e.g., LSD+U including "orbital po- 
larization" term a 30 ' 31 or current- and spin-density func- 
tional theory^) is needed. 

In Appendix A we place the dichroic /-sum rule in the 
broader context of other known sum rules. We note in 
particular that by taking different inverse-frequency mo- 
ments, the interband MCD spectrum can be related to 
two other phenomena resulting from broken time-reversal 
symmetry, namely the ground state orbital magnetiza- 
tion and the intrinsic anomalous Hall effect. These are 
generally expected to coexist, and this is indeed the case 
for ferromagnets, where all three occur spontaneously. 



In the case of Pauli paramagnets, however, the intrinsic 
Hall mechanism of Karplus and Luttinger has received 
little if any attention. On the other hand, it is known 
that Pauli paramagnets can display a field-induced MCD 



spectrum 



33.34 



This raises the question as to what role 



the Berry curvature may play in their "ordinary" (field- 
induced) Hall effect. Such a "dissipationless" contribu- 
tion is undoubtedly present in principle by virtue of the 
sum rule (|A7[) . First-principles calculations of this effect 
will be presented in a future communication^ 

To conclude, we have described the orbital magnetiza- 
tion of crystals in terms of localized (Mgg ) and itinerant 
(AM) parts, and shown how to relate them to magneto- 
optical and gyromagnetic observables. This should allow 
one to probe more deeply into the nature of magnetism 
in condensed-matter systems than previously possible. 
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APPENDIX A: OTHER SUM RULES 

In this Appendix we derive three additional sum rules 
for Bloch electrons and discuss their relation to the 
dichroic /-sum rule. All four involve inverse-frequency 
moments (w~ p cth) [in the notation of Eq. (TTTj) ] of the 
absorption spectrum (JSJ). They are given by p = and 
p = 1, and in each case two sum rules are obtained by 
taking the real and imaginary parts: one ordinary and 
the other dichroic, respectively. 

We first consider p = 0. From the imaginary part of 
Eq. ([55)) we obtained the dichroic /-sum rule (f4"5)) . To 
discuss the real part we revert from (|38|) to the form 



•i „ n occ empty 

2m~J (2^)S^ ^ ■ tnm ' a 



'S,a0/ 



(Al) 



SillCe fnm,a{3 — [,/*m7i,a:/3] •> 

occ empty occ 

^ ^ ^ ^ f nm,a.p ^ ^ ^ ^ fnm,a.{3 
n m n m^n 



E 



Saf 



e / n,a/3 



,(A2) 



where the second equality is the effective-mass theorem. 
Hence we find 



7re 



dk 



2m e J (2tt) 3 



E 



o a p — I 



e / n,a/3 



(A3) 



the modified f -sum rule**- for the ordinary spectrum. 
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To obtain the two sum rules for p = 1 we again start 
from Eq. l[3"2"]). but now replace Eq. ([M]) by 



occ empty 

EE 



fnm,a[3 2,Ul e 



where b n , a p was defined in Eq. (|35|) . Thus 

= 2 r dk . 



(w er H ) 



7re 



(2tt) 3 



(A4) 



(A5) 



For the dichroic part, noting that flu = — 2b^ is the 
Berry curvature summed over the occupied states at k, 
and comparing with the "intrinsic" Karplus-Luttingcr 
Hall conductivity^! 



, . , e 2 f dk 

^ = o) = --y _ 



one finds the Hall 



rule, 



(lj- 1 *" 



J = 2°a(w = 0). 



(A6) 



(A7) 



This is the co — ► limit of the Kramers-Kronig relation 
for the antisymmetric conductivity^ Since only the m- 
terband part of the optical conductivity was included on 
the left-hand-side, the intrinsic dc Hall conductivity was 
obtained on the right-hand-side. Extrinsic contributions 
to the latter (e.g., skew scattering) presumably arise from 
intraband terms in the former. 

Finally consider the ordinary (real) part of Eq. (|A5|) . 
The quantity b' n a p is the quantum metric J£ It is related 
to the localization tensor K a p of insulators by2£ 



A Q fl = — 



dk 

( 27r )3°W. 



(A8) 



where N/V is the electron density. Hence we recover the 



electron localization 



rul 



,19 



(lj 1 a! 



S,a[3/ 



ne 2 N 

hv 



-A 



a/3- 



(A9) 



In summary, we have in Eqs. (|38[) and (|A5[) two gen- 
eral sum rules for the zero-th and first inverse frequency 
moments of the optical absorption, respectively. Tak- 
ing imaginary and real parts of (|38p gives the dichroic 
/-sum rule (|43|) and the modified ordinary /-sum rule 
(|A3[) , while taking imaginary and real parts of (|A5|) gives 
the Hall sum rule (|A7[) and the electron localization sum 
rule (|A"9|l . 

Besides emerging from a unified formalism, the four 
sum rules display certain similarities. For instance, it 
will be shown in Appendix [B] that the dichroic /-sum 
rule yields the expectation value of the many-electron 
center-of-mass circulation operator, while the trace of 
the localization tensor yields the spread of the center- 
of-mass quantum distribution. Moreover, in a one- 
particle picture each quantity can be viewed as the 



gauge-invariant part of the corresponding property (self- 
rotation or spread) of the Wannier orbitals, as discussed 
in Sec. Ill CI for bounded systems. There is however one 
important difference between the behavior of the two 
quantities in the thermodynamic limit. While the center- 
of-mass circulation remains well-defined for metals, the 
trace of the localization tensor is only meaningful for in- 
sulators, diverging in metals i 19 ' 38 Interestingly, the de- 
localization of electrons in metals is also responsible for 
a correction to the /-sum rule. Contrary to the canoni- 
cal /-sum rule for atoms*! the modified /-sum rule (|A3|) 
does not yield the number density of valence electrons in 
a metal, due to the presence of the last term on the right- 
hand-side. This term appears because the Bloch states 
are extended and do not vanish at infinity^ The fact 
that the correction term nevertheless vanishes for insula- 
tors is a consequence of the localized nature of insulating 
many-body wavefunctions in configuration spaced 

We conclude by noting that Eq. (|A7|) provides an ex- 
treme example of how sum rules from atomic physics can 
change qualitatively when applied to extended systems. 
Indeed, the corresponding sum rule for bound systems 
produces a vanishing result^ 



— lmTi[Pf a Qfp\ 



0. 



(A10) 



In contrast, the bulk formula (|A7j) produces for Chern 
insulators a quantized Hall conductivity, and it also de- 
scribes the intrinsic anomalous Hall conductivity of fer- 
romagnetic metals^ This apparent contradiction high- 
lights the subtleties associated with the process of 
taking the thermodynamic limit and switching from 
open to periodic boundary conditions for non-Wannier- 
representable systems. Such issues are still not fully re- 
solved in the theory of orbital magnetization. While a 
general derivation of the bulk formula for M or b has been 
given working from the outset with a periodic crystal^ 
derivations which start from finite crystallites and take 
them to the thermodynamic limit (Refs. and and 
Appendix [Cj are presently restricted to conventional in- 
sulators. 



APPENDIX B: DICHROIC /-SUM RULE AND 
THE MANY-BODY WAVEFUNCTION 



In the main text we interpreted the dichroic /-sum 
rule, and the associated decomposition (fTB]) of M or b, in 
an independent-particle picture based on WFs. It is also 
possible to relate these quantities directly to properties 
of the many-electron wavefunction, without invoking any 
particular single-particle representation. In preparation 
for that, let us first discuss a one-electron system (e.g., a 
hydrogen atom in a magnetic field). Its absorption spec- 
trum is composed of sharp lines, and is more conveniently 
described in terms of an oscillator strength rather than 
an optical conductivity. Taking the imaginary part of 
Eq. and using the relation (|T5|) to replace one of the 
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velocity matrix elements, 

fnm,ap = ~[{n\f a \m)(m\iip\n) - (a <-> /?)]. (Bl) 

Summing over m ^ n and using the closure relation to- 
gether with (n|v|n) = one finds, in vector notation, 

E f4' ro = - ! ^H?xv|n). (B2) 

This is the original dichroic /-sum rule of Hasegawa and 
Howard^ with the orbital angular momentum appearing 
on the right-hand-side; in the notation of Sec. Ill Bl it reads 
(cr'l) = (nec/h) M orb (since here AM = 0). 

We now generalize the discussion to iV-electron sys- 
tems. In this context r = Y^,i=i *i an d v = J2iLi ^i, an d 
it is crucial to make a distinction between the one-particle 
operator A^) = YliLi x and the two-particle opera- 
tor f xv appearing in Eq. (|B2[) . as emphasized in 
Ref. |U The former is related to the electronic angular 
momentum and orbital magnetization, while the latter is 
related to a many-electron center-of-mass circulation. (In 
the classical context, for example, a pair of electrons or- 
biting 180° out of phase in the same circular orbit would 
have AW ^ but = 0.) 

The derivation of the dichroic sum rule for the N- 
electron case proceeds as before, except that the velocity 
matrix elements in Eq. ^ become v nm = (\f r ri |v|\f r . m ), 
where |\l/ m ) are now many-body cigenstates. The result 
is still given by Eq. (|B2[) . with \n) replaced by \^ n )- In- 
deed, it is natural to define the many-body generalization 
of Eq. (H7J) as 

M« ^ 7 (*„|A( 2 >|vI> m ) (B3) 

so that Eq. continues to hold. From this many-body 
perspective the difference AM with respect to the full 
M or b is seen to arise from the cross terms Ylij^i *i x Vj 
in A^-AW. 

To recover from (|B3[) the independent-particle expres- 
sion p7[) we specialize to the case where \^ m ) is a single 
Slater determinant. In second-quantized notation f = 
£« r ii c l c i> v = Eij vycjcj, and |* ) = c\ . . . cj^O), 
where i and j label orthogonal one-particle states. Then 
Eq. (|B3[) becomes 

- 7 ((0|cjv Wlci) x (B4) 

x (E vwcic,)(4---4|0»- (B5) 

Terms in which the indices do not pair can imme- 
diately be eliminated. Furthermore, pairings of the 
form (k = I, i = j) give no contribution, since this 
leads to (X)° CC r u) x (Y^k C ° v kk) which vanishes because 
(^olvl^o) = 0. The only surviving terms are those with 



(j = k,i = l), yielding 

m sr = 7E r ^ x Vji({*o\cj4)(<$Ci\* )) 

occ empty 

■ E ' y a0- - n ^ n * " E E r -..' ' 

(B6) 

where n,; is the state occupancy. Clearly the expression 
on the right-hand-side is equivalent to that in Eq. (fT6]) . 

A similar analysis can be made for the other sum rules 
presented in Appendix lAl For example, the counterpart 
of the Hall sum rule for a bounded many-electron system 
reads 

E— = -^<*o|rxf|tt o )=0, (B7) 

which was termed in Ref. [H the Kuhn sum rule. The 
independent-particle form (|A10[) can be recovered from 
(|B7|) along the lines of Eqs. (|B4)) - ([B6)) . As for the elec- 
tron localization sum rule, it yields the second cumulant- 
momcnt of the quantum distribution of the many- 
electron center-of-massJ^ In the independent-particle 
limit this reduces to k a p — (l/N)Ti{Pr a Qrp], whose 
trace yields the gauge- invariant WF spread (|29|) . The 
bulk formula (|A8[) for insulating crystals can be recov- 
ered in the thermodynamic limit following the strategy 
described below for the orbital magnetization. 

APPENDIX C: THERMODYNAMIC LIMIT 

In this Appendix we start from the expressions (JT7J) 
and l|18p for Mg 1 ^ and AM of insulating crystallites and, 
by taking the thermodynamic limit in the Wannier rep- 
resentation, turn them into the reciprocal-space expres- 
sions flmj and ([37). 

Before proceeding, recall that the quantities gk and hk 
entering Eqs. l[Ig |) -([37 |) were defined in Eqs. (fST j) 
in the context of the "Hamiltonian gauge" in which n 
labels a Bloch energy eigenstate. Here, we work with 
a generalized Wannier representation as in Sec. IIII Bl 
where n labels a Wannier function and |i£ n k) is the state 
of Bloch symmetry (generally not an energy eigenstate) 
constructed from that Wannier function^ The two rep- 
resentations arc related by a fc-dcpendcnt unitary rota- 
tion as in Eq. (|62p . Then Eq. (|36|) remains valid in the 
present context, since it already takes the form of a trace, 
while Eq. ([57)) is now replaced by 

hk.afj = E ^«mk(9 a Mmk|9/3M„k), (CI) 
n m 

where E nmk = (u nk \H k \u m k) ■ With g k and h k written 
as traces in this way, it is evident that each is a gauge- 
invariant quantity^ 
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1. Gauge-invariant self-rotation 

For insulating crystallites in the thermodynamic limit, 
Eq. (fTf)) can be replaced by Eq. (|49|) . Thus we need 
to establish the equivalence between Eqs. (|4"9"| and P5)l . 
Using 



(C2) 



and specializing to the z-component of Eq. 



M, 



(i) 

SR,z 



Im tr c [PxQHQy - PHPxQy] . (C3) 



The second term above may be expanded as a trace in 
the Wannier representation as 



tr c [PHPxQy] = ^ ^(Om|i?|Rn) (Rn|xQy|Or 



R ran 

Then using the identities 



(C4) 



dk 



(Om\H]Hn) = V c / e - lk - R P mnk , (C5) 



^3 e ik - R (a^„ k |a yUmk ), (ce) 

we obtain 

ytx c [PHPxQy] = J (C7) 

Using a similar argument, it follows that 

I f t/k 

-tr c [PxQPQy] = y T^s^a/J. (C8) 



Combining the above two equations with Eq. (|(J3|1 then 
yields Eq. ([35]). 



2. Gauge-invariant remainder AM 

To take the thermodynamic limit of AM we start from 
Eq. (fT8|) and apply it to a large crystallite to arrive at 
Eq. (|47p . Focusing on the ^-component, 

(AM), - 7 Tr[P.TQ{y - 7 Tr[PyQ{; ;c ] . (C9) 

Now use Eq. (|C2[) to obtain 

(AM) Z = -2vfTr[xPyPHP - yPxPHP], (CIO) 

where we defined 7 = and replaced Pi? by the 
more symmetrical form PHP. Using — iTt[(D — Cft] = 
2ImTr[C>], this becomes 

(AM), = 47TmTr[£Py(PPP)]. (Cll) 



At this point we are still considering a bounded sam- 
ple. To obtain a bulk expression we first need to manip- 
ulate Eq. (jCllj) into a form where the unbounded oper- 
ators x and y are sandwiched between P and Q, as in 
Eq. (|C3|) . That ensures that ill-defined diagonal position 
matrix elements between the extended Bloch states do 
not occur. We will make use of the following rules for 
finite-dimensional Hermitian matrices A, B, C , and D: 



(i)ImTr[ABCP] = lmTr[DABC] 



(ii)ImTr[ABCP] = -lmTi[DCBA] 



(iii)ImTr[AB] = 



(C12) 



(C13) 



(C14) 



and, if any two of the matrices A, P, and C commute, 

(iv)ImTr[ABC] = 0. (C15) 

Rules (i) and (ii) result from elementary properties of the 
trace. Rule (iii) is a consequence of (i) and (ii), and rule 
(iv) follows from (iii). Replacing the first P in Eq. (|C11[) 
by 1 — Q and applying rule (iv) to the term containing 
1 ([x,y] = and PHP is Hermitian), we obtain 

(AM)* = -471m Tr [PHPxQy], (C16) 

which has the desired form. 

Now we invoke Wannier-rcprcscntability to write 

Tr[PHPxQy] = £ ( Wj \HP(x - x 3 )Q{y - y )\ W] ) 

( C1? ) 

(note that PfQ = 0). Since only the relative coordinate 
appears, the contribution from the surface orbitals is non- 
extensive, vanishing in the thermodynamic limit. We are 
then left with a bulk-like expression: 

Tr[PPPiQy] -► ^ ^ (Rm\H\R'n) (R'n\xQy\Rm) . 

Rm R/n 

(Ci8) 

Both matrix elements on the right-hand-side depend on 
R and R' only through R' — R, and therefore, comparing 
with Eq. (|C4| . 



— Tr[PHPxQy] tr c [PHPxQy] , (C19) 



where iV c is the number of crystalline cells in the sam- 
ple. Combining Eqs. ([C7| . (|C16|) and (|C19j) one obtains 
Eq. (|T7|) . which concludes the proof. 
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